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Abstract 

The edit distance between two graphs on the same labeled vertex set is defined 
to be the size of the symmetric difference of the edge sets. The edit distance function 
of a hereditary property 'H is a function of p € [0,1] that measures, in the limit, the 
maximum normalized edit distance between a graph of density p and T-L. 

In this paper, we address the edit distance function for Forb(fl), where H = 
the power of the cycle of length h. For h > 2t{t + 1) + 1 and h not divisible 

by t + 1, we determine the function for all values of p. For h > 2t{t + 1) + 1 and 
h divisible by t + 1, the function is obtained for all but small values of p. We also 
obtain some results for smaller values of h. 
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1 Introduction 

The edit distance in graphs was introdnced independently by Axenovich, Kezdy, and 
Martin [2] and by Alon and Stav [1]. The qnestion considered is “Given a class of graphs 
R what is the minimnm nnmber m = m{n) snch that for every graph on n vertices, there 
is a set of m edge-additions and edge-deletions that ensnre the resnltant graph is a member 
of HT' For every hereditary property there is a p = p('H) snch that the Erdds-Renyi 
random graph G{n,p) is asymptotically extremal [1]. A hereditary property is a family of 
graphs that is closed nnder isomorphism and the taking of indnced snbgraphs. 

The edit distance fnnction of a hereditary property "H is a fnnction of p G [0,1] that 
measnres, in the limit, the maximnm normalized edit distance between a graph of density 
p and "H. A principal hereditary property, denoted Forb(i/), is a hereditary property that 
consists of the graphs with no indnced copy of a single graph H. Most of the known edit 
distance fnnctions are of the form Forb(if). These inclnde the cases where if is a split 
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graph [7] (including cliques and independent sets), complete bipartite graphs 7^2,4 [9] and 
[5] and cycles Ch where h is small [6]. In this paper, we compute the edit distance 
function for powers of cycles. 

For positive integers t and h, the power of a cycle of length h is denoted Cj^ and has 
vertex set {1,..., h}, where two vertices are adjacent in Cj^ if and only if their distance 
is at most t in Ch- 

The notation in this paper primarily comes from [8]. The edit distance between graphs 
G and G' on the same labeled vertex set is denoted dist(G, G') and satishes dist(G, G') = 
\E{G) A E{G')\. The edit distance between a graph G and a hereditary property Ti is 

dist(G',77) = min{dist(G,G') : V{G) = V{G'),G' e U}. 

The edit distance function of a hereditary property "H measures the maximum distance 
of a density p graph from Ti, i.e. 

ed^(p) = lim max{dist(G,77) : |I^(G)| = n, \E{G) \ = /{X)- 

Balogh and Martin [3] showed that this limit exists and is equal to lim E[dist(G(n, p) , 77)], 

n^oo 

using the result of Alon and Stav in [1], The function has a number of interesting prop¬ 
erties: 

Proposition 1 (Balogh-Martin [3]). 7/77 is a hereditary property, 7/iened^(p) is contin¬ 
uous and concave down over p G [0,1]. 

By the proposition above, the function ed^ achieves its maximum in [0,1]. We denote 
this maximum value by d'^, and the the set of all values of p for which the maximum is 
achieved by p'^. 

A colored regularity graph (CRG), K, is a complete graph with a partition of the 
vertices into white VW(77) and black VB(77), and a partition of the edges into white 
EW(77), gray EG (77), and black EB(77). We say that a graph 77 embeds in K, denoted 
77 I— )■ 77, if there is a function ip : V{H) —)■ V{K) so that if hih 2 G E{H), then either 
p{hi) = (p{h 2 ) G VB(77) or (p{hi)ip{h 2 ) G EB(77) U EG(77), and if T-iT-a ^ E{H), then 
either p{hi) = (^(hs) G VW(77) or p{hi)(p{h2) G EW(77) U EG(77). 

Given a hereditary property 77, it is easy to see that it can be expressed as 77 = 
f){Forb(77) : 77 G 7^(77)} for some family of graphs 7^(77). We denote /C(77) to be the 
subset of GRGs such that no forbidden graph embeds into them, i.e. /C(77) = {77 : 77 0- 
77,V77 G 7^(77)}. In our case, /C(77) = (77 : 77 0 77} for 77 = Forb(77) and so a GRG 77 
is a sub-CRG of 77 if 77 can be obtained by deleting vertices of 77. 

For every GRG 77 we associate a function g on [0,1] dehned by 

gxip) = min{x^Mi^(p)x ; x'^1 = 1, x > 0}, (1) 

where 

( p, if ViVj G EW(77) or Vi = vj G VW(77); 

[MK{p)]ij = < l-p, if ViVj G EB(77) or Vi = Vj G VB(77); (2) 

[ 0, iiviVj eEG{K). 
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The g function of CRGs can be used to compute the edit distance function. Balogh and 
Martin [3] proved that ed'^(p) = inf gxip) and Marchant and Thomason [5] further 

proved that the inhmum is achieved by some K, i.e. ed-^lp) = min gxip)- So, for every 

p G [0,1], there is a CRG K G /C('H) such that ed^(p) = gxip)- R is also shown in [5] 
that in order to hnd such GRG we only need to look at so called p-core GRGs. A GRG 
K is p-core if g^ip) < gxip) for every snb-GRG K of K. 

The GRG with r white vertices, s black vertices and all edges gray is denoted K{r, s). 
The clique spectrum of the hereditary property "H = Forb(iJ), denoted r('H), is the set of 
all pairs (r, s) snch that H ^ K{r-,s). It is easy to see that, for any hereditary property 
'H its cliqne spectrnm T = r('H) can be expressed as a Ferrers diagram. That is, if r > 1 
and (r, s) G F, then (r — l,s) G F and if s > 1 and (r, s) G F, then (r, s — 1) G F. An 
extreme point of a cliqne spectrnm F is a pair (r, s) G F snch that (r + 1, s) and (r, s + 1) 
do not belong to F. The set of all extreme points of F is denoted by F*. 

Dehne the function 7 ^(p) = mm{gK(r,s){p) ■ {r,s) G F('H)}. Glearly, edn{p) < 1 h{p)- 
Moreover, one only need consider the extreme points rather than whole F itself, that is, 
1h{.p) = min{pi^(^,,)(p) : (r, s) G T*{n)}. 

In this paper, the hereditary properties we consider are of the form "H = Forb(C'^). 
Since Theorem 20 in [8] gives edForb(x^)(p) = p/{h — 1), we will assnme that h > 2t + 2. 
For convenience, we denote ia = , for a G {0,... ,t}. We also denote po = 

The motivation for these values will be discussed in Section 5. 

The main resnlts of this paper are Theorems 2 and 3. 

Theorem 2. Let t > 1 and h > max{t(t + 1), 4} he integers. For all p G [0,1], 

p{l-p) 


-fn{p) = min 


Inip) = min 


e{o,i,...,i} [ a(l - p) + (4 - 1) P J ’ 

p p(l — p) 


ae{o,i,...,q [ t + 1 ’ a(l - p) + (4 - 1) P J 


if (t + 1) I h; 


Note: If a = 0, then . if , which we dehne to be ^ at p = 0. 

’ a(l-p)+(£o-l)p (fo-l)p’ ^0-1 ^ 

Theorem 3. Let t > 1 and h > 2t{t + 1) + 1 he positive integers and let Ti = Forb(G^). 
If (t + 1) / h, then for 0 < p < 1, and if (t + l)\h then for Po < P < 1, we have that 


ed^(p) = Inip)- 

Corollary 4. Let h > 5 he a positive integer and H = Forb(Cft). 

• If h is even, then for \h/3~\~^ < P < 1, 

p(l — p) 1 — p 


(3) 


ed^(p) = min 


i-p+(rv3i -i)p’ rv2i -ij ■ 
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• If h is odd, then for 0 < p < 1, 

. Jp p{I-p) 1-p \ 

'«<!’> ™“|2’i-p+(fV3i-i)p’rfc/2i-ir 

It was shown in [ 6 ] and in [5], respectively, that 

edForb(C 3 )(p) = P /2 and edForb(C 4 )(p) = P (1 - P)- 

It follows from this and the above corollary that when t = 1, the fnrthest graph from 
Forb(C/i) is a graph which has density p* = l/{\h/2\ — [h/3] + 1) when h > 4 and 
h ^ {4, 7, 8,10,16}, and has density p* = 1/(1 + \^\h/3~\ — 1) when h G {4, 7, 8,10,16}. 
Also, observe that the maximum value of the edit distance function can be an irrational 
number. 

Our proof techniques often require us to compare the g function of a CRG to one of 
the individual functions that are given in Theorem 2. However, when h is large enough 
at most 3 of these functions are necessary to dehne 7 ^. 

Corollary 5. Let t > 2 and h > At^ + lOt + 24 he positive integers. Reeall it = 
andpo = if^. Then 

• // (t + 1) I h, then for po < p <1, 

A ( \ • / P (1 -P) 1 - P 1 


• // (t + 1 ) / h, then for 0 < p < 1, 


ed-H(p) 


min 


p p{l — p) 1 — p 1 

t+ 1' t{i - p) + {it - i)p’ 4 - 1 J ’ 


The rest of the paper is organized as follows: Section 2 provides some definitions and 
basic results. Section 3 gives the proof of Theorem 2, Section 4 gives Lemma 13 which 
is the key lemma for the proof of Theorem 3, Section 5 gives the proof of Theorem 3, 
Section 6 gives the proofs of some helpful lemmas and facts, and Section 7 gives some 
concluding remarks. 


2 Definitions and Tools 

All graphs considered in this paper are simple. For standard graph theory notation please 
see [ 12 ], for the edit distance notation please see [ 8 ]. A sub-CRG K' of a GRG 77 is a 
component if it is maximal with respect to the property that, for all v,w E V{K'), there 
exists a path consisting of white and black edges entirely within K'. It is easy to compute 
the g function of a GRG given the g function of its components: 
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Proposition 6 ([6]). Let K he a CRG with components ^one? p G [0,1]. 

Then = EI=i (few(p))“^- 


-1 


Note that by Proposition 6 , gK{r,s){p) = [^ + T^J ■ 

Let iL be a CRG, v G V{K), and let x be an optimal solution to the quadratic 
program (1). The weight of n, denoted x(n), is the entry corresponding to v of the vector 
X. We say that w G V{K) is a gray neighbor of n G V{K) if w is adjacent to v via a gray 
edge. White and black neighbors are dehned analogously. The set of all gray neighbors of 
V is denoted by Nc{v) and the number of vertices adjacent to v via gray edges is denoted 
by degG(n), i.e. degG(n) = 

In contrast, the gray degree of n, denoted dciv), is the sum of the weights of gray 
neighbors of n, i.e. dciv) = ^ Similarly, the white degree of v, 

denoted dw{v), is the sum of the weights of the white neighbors of v plus the weight of v 
if and only if it is a white vertex. The black degree of v, denoted dsiv), is the sum of the 
weights of the black neighbors of v plus the weight of v if and only if it is a black vertex. 
So, dciy) + dw{v) + dsiy) = 1 for all n G R(iP). 

The number of common gray neighbors of vertices v and w is denoted by degg(n,tc). 
The gray codegree of vertices v and w, denoted dciv^w), is the sum of the weights of the 
common gray neighbors of v and w. For a set of vertices {^ 1 ,^ 2 ,..., we say V 1 V 2 ■ ■ - ve 
is a gray path if ViVi+i G EG(iL) for i = 1, ...,£— 1. Analogously, we say V 1 V 2 ■ ■ ■ viV\ is a 
gray cycle if ViVe G EG(iL) and ViVi+i G EG(iF) for i = 1, ...,£— 1. Proposition 7 gives 
a structural classiheation of p-core CRGs. 


Proposition 7 (Marchant-Thomason [5]). Let K he a p-core CRG. 
• If p = 1/2, then all of the edges of K are gray. 


• If p < 1/2, then EB(iF) = 0 and there are no white edges incident to white vertices. 


• Ifp > 1/2, then EW(A') = 0 and there are no black edges incident to black vertices. 

Proposition 8 gives a formula for dciv) for all v G V{K) and Proposition 9 uses this 
to give a bound on the weight of each v. 

Proposition 8 ([6]). Let p G (0,1) and K be a p-core CRG with optimal weight function 

X. 


If P < 1/2, then x(n) = gK{p)/{l — p) for all v G VW(iP), and 

dciv) = ^ ~ + ^^^x(t;), for all v G VB(iL). 


p 


p 


If P > 1/2, then x(n) = gKip)/p for all v G VB(A'), and 

dG{v) = ^~P~ 3 k{p) ^ ^ ^ vW(iP). 


1 — p 


1 — p 


5 



Proposition 9 ([6]). Let p G (0,1) and K be a p-eore CRG with optimal weight func¬ 
tion X. 

• If P < 1/2, then x(w) < gK{p)/{l — p) for all v E VB{K). 

• If p > 1/2, then x(w) < gK{p)/p for all v E VW(i^). 


3 Proof of Theorem 2: Computation of the 77 ^ func¬ 
tion 


In this section we compute the 7 -^ function, which gives an upper bound for the edit 
distance function. Recall that for any t > 1, h > 2t + 2 and a E {0,... ,t}, we denote 

Proof of Theorem 2. Our first observation is the value of the chromatic number of 
denoted x(0/). 

Proposition 10 (Prowse-Woodall [11]). Let t > 1 and h > max{t + 1,3} he positive 
integers. Let h = q(t -\- 1) -\- r, where r E {0,... ,t}. Then, x(0/) = t -\- \r/q~\ -\- 1. In 
particular, if h > max{t(t + 1),3}, then 


.vi'r'O — / ^ (f + 1) \ h; 

\t + 2, tf{t + l))fh. 

Let h > max{t(t + 1), 2t + 2} and y = x(0/). Denote the vertices of O/ by {1,..., h} 
such that distinct i and j are adjacent if and only if \i — j\ < t (mod h). For each 
a E {0, ...,t}, we hrst show that (a,^ —1) G F = F(Forb(0/)) and then show that 
(a, ia) ^ F. We will also show that if y > t + 1 then {(t + 1, 0),..., (y — 1, 0)} C F but 
that {t + 1,1) 0 F. 

This will imply that F* C {(a, 7 — 1) : a = 0,1,..., t}U{(y—1, 0)}, which is a stronger 
result than we need. 


Case 1: a G {0,..., t}. 

First, we show that (a, 7 — 1) G F. By contradiction, assume there is a partition of 
V(Cl) into a independent sets and ia — f cliques. Let k = ia — I, and let Ci,... ,Ck be 
the cliques. We may assume that the vertices in each Ci are consecutive. This is because 
if ji and j 2 are in the same clique, then by the nature of adjacency in the power of a cycle, 
every vertex between ji and j 2 is adjacent to every member of the clique, and hence can 
be added to the clique. Thus, \Ci\ < f + 1 for i = 1,..., /c. 

For i = 1,..., /c — 1, let Rj be the set of vertices between Ci and Cj+i, and let Bk 
be the set of vertices between Ck and Ci. The sets Bi might or might not be empty. If 
some |Rj| > a + 1, then the first a + 1 < t + 1 vertices form a clique and so must be in 
different independent sets, which is not possible since there are only a independent sets. 
Therefore, \Bi\ < a for i = 1,..., k. 
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Consequently, we need k{t + a + 1) > h m. order to cover with a independent sets 
and k cliques. Hence, /c > a contradiction to our choice of k. Thus (a, — 1) £ T for 

a = 0 ,..., f. 

Next, we show that {a, la) ^ T. Again, let /c = £„ — 1- For i = 1,..., /c, let A* = 
{(i — l)(t+ a +1) +1,.. ., i(t + a +1)} and let Sk+i = {1; • • •; ~ ^i=iSi- For i = 1,..., k, 

let Ci be the first t + 1 vertices of Si and let Ck+i be the first niin{t + 1 , lA^+il} vertices 
of S'fc+i. For 7 = 1,..., a, let A,- consist of the (t + l + 7 ')*'^ vertex of S'!,..., Sk and the 
(f + 1 + j)*’' vertex of Sk+i if \Sk+i\ >t + l+j. 

The sets (Ai,..., A^, Ci,..., Ck+i) form a partition of V(Cl). Clearly each Q, i = 
1,..., k, is a clique of size t + 1 and since there is a clique of size t + 1 between pairs of 
vertices in each A^, each Aj is an independent set. Thus (a, ^ F for a = 0,..., t. 


Case 2: a > t + 1. 

If (t+1) I h, then Proposition 10 gives that can be partitioned into t+1 independent 
sets and so (t+1, 0) 0 F. If (t+1) / h, then Proposition 10 gives that y > t+2 and since 
cannot be partitioned into fewer than y independent sets, we have (t+ 1 , 0 ),..., (+— 1 , 0 ) G 
F. Since can be partitioned into y independent sets, (y, 0) ^ F. 

Finally, let k = \h/{t + 1)] — 1. For j = 1,..., t + 1, let Aj = {(i — l)(t + 1) + j : t = 
1,..., k}. Let Co = {k{t + 1) + 1,..., h}. The sets (Ai,..., At+i, Co) form a partition of 
V(Cl). Clearly, Co is a clique of size at most t + 1 and since there are at least t vertices 
between pairs of vertices in each Aj, each Aj is an independent set. Thus (t + 1,1)^ F. 

Using Proposition 6, if h = g(t + 1) + r where r G {0,..., t}, then 


luip) 

7«(p) 


min 

aG{0,l,...,q 


min 

aG{0,l,...,q 


P(l-P) 1 
a(l -p) + (4 - l)pi ’ 

p p(l — p) 

t + \r/q] ’ a(l - p) + (4 - 1 ) P 


if r = 0 ; 
if r 7 ^ 0 . 


Restricting ourselves to h > min{t(t + 1),4}, we have the result in the statement of 
the theorem. □ 


4 Forbidden Cycles 

Before we can prove Theorem 3, we need to study the properties of the CRGs into which 
Cl does not embed. Recall that we may assume h > 2t + 2. An important property of 
such CRGs is that the set of lengths of gray cycles on black vertices is restricted, as is 
shown in Lemma 13. Its proof needs the technical inequalities in Facts 11 and 12. For 
completeness, we give their proofs in Section 6 . 

Fact 11. Let h,x,y be positive integers. Then 

(a) \h/x\ > y if and only if \_h/y\ > x. 

(b) \h/x) < y if and only if \h/y~\ < x. 
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Fact 12. Let t > 1, h > max{t(t — 1), 2t + 2}, and a G {0,..., t — 1} be positive integers. 

Then [^1 < [fj. 

Lemma 13 is a key lemma in proving our main result of Theorem 3. 

Lemma 13. Let p G (0,1/2] and let t > 1 and h > 2t + 2 he integers. Let K be a 
p-core CRG with exaetly a white vertices such that C/ 0- K. Let K he the sub-CRG of K 
induced by the set of all black vertices of K. Then, the following occurs: 

(a) If a E {0,..., t — 1} and h > G — t, then K has no gray cycle which has length in 

(b) If a = t, then |F(iL)| < — 1. 

(c) If a > t + 1, then {t + 1)){ h and V{K) = 0. 

Note: We interpret a gray cycle of length 2 to be a gray edge. 

Proof of Lemma 13. Denote the vertices of by {1,..., h} such that distinct i and j 
are adjacent if and only if |f — jj < t (mod h). 

Partition: Let K have a gray cycle on vertex set {ui,..., Vk\ such that is a gray 

edge, where the indices are taken modulo k. We describe a partition of V(C^), which 
gives an interval of forbidden gray cycle lengths. We will construct at most a independent 
sets and k cliques Ci,... ,Ck such that there is no edge between nonconsecutive cliques. 

Partition V (C/) into k sets of consecutive vertices Si,..., Sk, with each set Si of size 
either \h/k~\ or \_h/k\. We will eventually construct the at most a independent sets and 
k cliques Ci,... ,Ck with Ci C 0 such that there is no edge between 0 and Q/ unless 
\i — i'\ = 1 (mod k). 

If a = 0, then simply let C* = 0 for i = 1,... ,k. Using Fact 14, each Ci has size 
at least t and so nonconsecutive sets have no edge between them. Fact 14 is a simple 
observation of number theory. 

Fact 14. A set of size h can he partitioned into sets of size t or t + 1 if and only if 
h > t(t — 1). Moreover, for any k E {\h/(t + 1)~\,..., \_h/t\}, such a partition exists with 
exactly k parts. 

So, we assume a > 1 and choose a' G {\h/k\ —t, \h/kl\ — (t + 1)} such that 0 < a' < a. 
This is possible as long as both (a) 0 < \h/k\ — t and (b) \h/k'\ — (t + 1) < a. (This 
is only nontrivial ii k \ h, in which case at least one of the two choices of a' will be in 
0,..., a.) 

If a' = 0, again let U* = 0 for i = 1,..., /c. If a' > 1, let Aj consist of the vertex of 
each of S'!,..., Sk and let Ci = Si — Observe that if a' > 1, then \Si\ >t + l and 

so there are at least t vertices between each pair of vertices in every Aj. Therefore, Aj is 
an independent set for j = 1,..., a'. We have |Uj| < t +1 so U* is a clique for i = 1,..., k. 
In addition, \Ci\ >t and so there are no edges between Ci and U*/ unless \i — i'\ (mod k). 



The mapping, for all a > 0, is as follows: Map each Aj to a different white vertex 
and Ci to Vi for i = 1,..., k. If a = 0, Fact 14 gives that K has no cycle with length in 
{\hl{t + l)'\,...,[h/t\}. If a > 1, Fact 11, gives that K has no cycle with length in 

h 

t T o T 1 

and (4) is valid in the case of a = 0 also. 

Case (a). 

The result is given by (4). It suffices to 
this holds if h > — f. 

Case (h). 

In this case, we use a second partition. Partition V (C^) into k + 1 consecutive parts. 
Si,, S'fc+i, where k = \h/{2t + 1)] — 1 and r = h — {k — l){2t + 1). Since h > 2t + 2, 
k > 1. Let |S'i| = ■■■ = \Sk-i\ = 2t + 1, |S'fc| = \r/2'\ and \Sk+i\ = Note that 

t + 1 < |S'fc_|_i| < \Sk\ < 2t + 1. 

For j = 1,..., t, let Aj consist of the vertex in each part and let Ci = Si — 

Each of Ai,..., is an independent set. Furthermore, there are no edges between 
Ci and Cii \i i ^ . Therefore, K has at most k = \h/{2t + 1)] — 1 vertices; otherwise, 
Ai,..., At can be mapped arbitrarily to each of the t white vertices and Ci,..., Ck+i can 
be mapped arbitrarily to k + 1 different black vertices in K. 

Case (c). 

If (t + 1) I h, then x(C^) = t + 1 and K having at least t + 1 white vertices means 
that embeds in K, a contradiction. If (t + 1) / h, then partition V(Cl) into k = 
\h/(t + 1)J + 1 parts Si,..., Sk of consecutive vertices, each of S*!,..., Sk-i of size t + 1. 
For j = 1,..., t + 1, let Aj consist of the vertex in each Si for i = 1,..., k — 1. The 
graph induced by V(Cl) — Ujii forms a clique of size at most t in S'*,. Since all vertices 
in K are black, this clique will embed into any vertex of V{K). Thus V{K) = 0. □ 

5 Proof of Theorem 3: = 77 ^ 

We will use Lemma 13 to prove Theorem 3. Recall that h > 2t{t + 1) + 1 > + 1). By 

Proposition 10, this means x(C^) = t + 1 if (t + 1) \ h and x(C7 = t + 2 if (t + 1) / h. 

Proof of Theorem 3. By dehnition ed-^(p) < 'yuip) for all V G [0,1], so we need to show 
equality. 

Case 1: p G [1/2,1]. 

Fact 15 below establishes that 7 'h(p) = for P ^ [1/2, !]• 


(4) 


show that t+a+i \ — 1 ■ 1^ gives that 
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Fact 15. Let h and t be positive integers. If h > (t + 1)^ + 1, then 


-^ 

io — 1 t + 1 

For a G {1,..., t} if h > (t + l)(t + a) + 1, then for all p E [1/2,1], 

1 -P ^ Pjl-P) 
io-l ~ a{l-p) + {la - l)p' 

Note: The condition h > 2t{t + l) +1 suffices to achieve all of the conclusions in Fact 15. 

By Proposition 16 below, ed'^(p) = ’Juip) for the two values ofpG {1/2,1}. 

Proposition 16 (Balogh-Martin [3]). If H is a hereditary property, then ed-^(l/2) = 
7^(1/2). Moreover, if eH for all positive integers i, then ed^(l) = 7^(1) = 0 and if 
Ki eH for all positive integers I, then ed^(O) = 7^(0) = 0. 

We have ed^(p) < 7 ^(p) and the two functions are equal at p = 1/2 and at p = 1. 
The function 7^(p) is linear over p G [1/2,1] for h > 2t{t + 1) + 1. By Proposition 1, 
ed^^(p) is continuous and concave down, so we may conclude that ed^(p) = 7 ^(p) = 
for p G [1/2,1], This concludes Case 1. 

Note that Proposition 16 gives ed^(O) = 7^(0) = 0. Let p G (0,1/2) and ed^(p) = 
fi'x(p) for some p-core CRG K. Assume by contradiction that Px(p) < 1h{p)- Suppose K 
has a white vertices. Recall that for any t > 1, h > 2t + 2 and a E (0,..., t}, we denote 
= \ pjZTTi] • We consider several cases and show that we arrive at a contradiction in 
each case. 

Case 2: a>t and p E (0,1/2). 

If a > t + 1, then by Lemma 13/c/, V{K) = 0. As long as h > max{t(t + 1),3}, 
Proposition 10 gives that x(C/) < t + 2 with equality only if (t + 1) / h . Thus, a = t + 1 
and Proposition 6 gives that gp;{p) = p/{t + 1), a contradiction to the assumption that 
aM < 7i?(p)- 

If a = t, then Case (b) of Lemma 13 gives that \V{K)\ < It — I- Consequently, 
fi'ic(p) > We can partition K into t + 1 sub-CRGs, K and t white vertices, and use 
Proposition 6 to conclude that 


{gM) ' <tp ' + ) 

p(i -p) 


-1 


9kip) > 


t{l -p) + {It - I)P 
Hence, ed-u{p) > 'Jnip), again a contradiction. This concludes Case 2. 
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Case 3: a <t — 2 and p G (0,1/2). 

Recall that K is a. CRG with a white vertices, with 0 < a < t — 2. By Proposition 6, 
9kiP)~^ = ap~^ + dK^ip)- Therefore, 


9k{p) < 


( max 


a' — a 

-h 

P 


L'-i 

1 — p 


-1 

=: go{a,t-,p). 


(5) 


Given our assumptions on gxip), Lemma 17 gives lower bounds on the gray degree of 
vertices and the codegree of pairs of vertices. Recall that degg(n) denotes the number of 
gray neighbors of n G V{K). 


Lemma 17. Let p G (0,1/2), t > 1 be an integer and a E {0,... — 1}. Let po = it = 

l"^^! ■ Let K be a p-core CRG with all black vertices such that gK{p) < 5'o(o,t;p). 

Then 


(a) for every v G V{K), we have degg(n) > ia+i, and 

(b) for every v,w E V{K), 


degG{v,w) > 


ia+2, ifa<t-2; 

1, if a = t — 1 and p > po- 


Note: Since h > 2t + 2, it is the case that ia+i ^ 2 for a < t — 1 and ia +2 > 2 for 
a < t — 2. 

Now we consider the derived graph F with vertex set V{K) and edge set EG(iL). 
Using Lemma 17, the lower bound on the number of common gray neighbors of v and w 
gives a structural restriction on this graph. Note that the length of a path is dehned to 
be the number of vertices in said path. 

Lemma 18. Fix integers t > 1, h > ma,x{t(t — 1), 2f + 2} and a E {0,..., t — 1}. Recall 
that ia= \h/{t + a + 1)] and let L = \_h/t\. 

Let F be a graph with no cycle with length in {C, ■ ■ ■ ,L} and every pair of vertices 
either has at least ia +2 > 2 common neighbors if a < t — 2 or has at least 1 common 
neighbor if a = t — 1. 

Then F has no cycle of length more than ia — i- 

Now we consider a maximum-length path in the graph F. If such a path can be made 
into a cycle, then Proposition 19 gives that F must be Hamiltonian. By Lemma 18, this 
means that |U(iL)| < ia — i and, as such, gxip) > which is the g function for the 
GRG onia — l black vertices with all edges gray. This is a contradiction to our assumption 
in (5) by setting a' = a. Proposition 19 is a common argument in proofs of Hamiltonian 
cycle results, including the classical theorems of Dirac [4] and Ore [10]. 

Proposition 19. Let F be a connected graph. If some path of maximum length forms a 
cycle, then F is Hamiltonian. 
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So we may assume that every maximum-length path in F is not a cycle. Let Vi - ■ -v^ 
be such a maximum length path. The common neighbors of Vi and V(, in F must be on 
this path, otherwise F has a longer path. From Lemma 17, it follows that vi and vi have 
at least ^a +2 > 2 common neighbors on this path. However, Lemma 20 gives that there 
can only be one such neighbor, a contradiction. 

Lemma 20. Fix integers t > 1, h > 2t + 2 and a G {0,... — 1}. Reeall that ia = 

\h/(t + a + 1)~\. Let F be a graph with no eyele of length longer than ia — 1, with every 
vertex having degree at least ia+i > 2 and with every pair of vertiees having at least one 
eommon neighbor. Furthermore, let F have the property that no maximum length path 
forms a eyele. 

Let Vi - ■ -vg be a path of maximum length in F. Then Vi and vi have exaetly one 
eommon neighbor Vc on this path. Furthermore, N{vi) C {v 2 ,...,Vc} and N{vi) C 
{vc,...,ve}. 

This concludes Case 3. 

Case 4: a = t — 1 and p G \po, 1/2). 

Recall that K is a CRG with a = t — 1 white vertices. By Proposition 6, g~f}{p) = 
(t — l)p~^ + gx^ip). Therefore, 

/ \ / \ f (a' — {t — 1) ia' — i- 

9k{p) < 9o{t - l,t;p) = max - - + -- 

Again, we consider the graph F with vertex set V{K) and edge set EG(iL). By 
Lemma 17, every vertex in F has degree at least it and every pair of vertices has at 
least one common neighbor. By Lemma 18, F has no cycle of length more than it-i — 1. 
If there is a maximum-length path that is a cycle, then Proposition 19 gives that F is 
Hamiltonian, which means |P(iL)| < it-i — 1. As a result, gxip) > ^ contradiction. 

So we may assume that every maximum-length path in F is not a cycle. Let Vi.. .vt 
be such a maximum-length path such that, in K, the sum x(ui) -|- x(u£) is the largest 
among such paths. Let Vc be the unique common neighbor of Vi and vg. 

Let Vi have d neighbors in F. Since vi cannot have neighbors outside of this path, 
the sum of the weights, in K, of the neighbors of vi satisfy dcivi) < x(v 2 ) + • ■ ■ + x(xc). 
Notice that if vt G {ui,..., Uc-i} is a predecessor of a neighbor of vi, then it is an endpoint 
of a path containing the same i vertices, namely UjUj-i • • • UiUi+iUj +2 ■ ■ - Vc - ■ - v^. Hence 
all d predecessors of gray neighbors of Vi (including vi itself) have weight at most x(ui). 
All other vertices have weight at most Proposition 8 gives 

-—+ - —^x(ui) = x(ui) -F dcivi) < x(ui) H-h x{vc) < dx{vi) + (c - 

p p 1 — p 

Rearranging the terms, we obtain 

9k{p) —- + -) > 1 - x(ui) (d - -—. 

\l-p pj \ P J 
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Since p ^ < Pq^ = £t and it < d+1, we may, by Lemma 17, lower bound the right-hand 
side by using x(ni) < from Proposition 9, 



(y^) > 1. 


Lemma 18 bounds the size of the longest cycle, so c < it-i — 1. Thus, gxip) > > 

a contradiction. This concludes Case 4. 

Case 5: a = t — 1 and p G (0,po)- 

It remains to prove the theorem for 0 < p < po = ^he case where (t + l))fh and 

a = t — 1. 

Fact 21. Let h and t he positive integers such that h > 2t + 2. Let po = it^ = ^ 

and recall that 

. jp _ P(l-P) 1 

amtt} 11 + 1 ’ a(l - p) + (4 - l)p J ■ 

Then 7 ^(p) =p/{t + l) for p e [0,po]. 

We have ed^(p) < 7 ^(p) and the previous case gives that the two functions are equal 
at p = Po- They are also equal at p = 0. By Fact 21, the function 7 ^(p) is linear over 
p G [0,po] for h > 2t + 2. By Proposition 1, ed^(p) is continuous and concave down, so 
we may conclude that ed^(p) = 7 ^(p) = for p G [0,po]. 

This concludes Case 5 and completes the proof of Theorem 3. □ 


6 Proofs of Lemmas and Facts 

Proof of Corollary 5. The case of t = 1 is covered by Corollary 4. 
Let a G {1,..., t — 1}. 


If p > 
If p < 


a 

a + Iq — ia 
t — a 


t - a + ia - it' 


then , ' > 

a(l -p) + (ia - l)p 

then__> 

a(l-p) + («.-l)p- 


1 — P 

4-i‘ 

p(i -p) 

t(i -p) (it - i)p‘ 


Therefore, it suffices to show 


t 


> 


t — a ia — it a -\- it) — ia 

(io-ia)(t-a) > (ia-it)a. 


( 6 ) 
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To that end, 


(4 - 4)4 


a) - {ia 


it)a = (t — a)4 + — tia 


> 


> 


(t — a)h 
t + 1 


+ 


ah 

2t + l 


th 

t T n T 1 


at(t — a)h 

(t + l)(t + a + l)(2t + 1) 

t(t — l)h 

(t + l)( 2 t)( 2 t + l) “ ■ 


If h > 4t^ + lot + 12 + then ( 6 ) is satished and the corollary follows. 


□ 


Proof of Fact 11. We only need to prove one direction because x and y are arbitrary. In 
both cases, we will prove the forward implication. 

(a) Let \_h/x\ > y and h = qx r, where r G {0,...,a; — 1}. Then y < \h/x\ = q, so 
h > xy + r. Thus \_h/y\ > x + \r/y\ > x. 


(b) Let \h/x~\ < y and h = qx — r, where r G {0,. .., a; — 1}. Then y > \h/x^ = q, so 
h < yx — r. Thus {h/y] < x — [r/y\ < x. 

□ 


Proof of Fact 12. Clearly, if a G {0,..., t — 1}, then j"< I" 7 ^] so it suffices to prove 
this fact for a = 0. Let h = qt + r with r G {0,..., t — 1}. Since h > t(t — 1), we have 
g > t — 1 > r. Then 


h 

t + 1 


q + 


r — q 
t + 1 


< q 



□ 


Proof of Fact 15. If h > (t + 1)^ + 1, then t + 2 < \h/{t + V)\ = 4- Consequently, 

t +1 < 2^4 +1) < p(4 + 4 

so 

For a G {1,... ,t}, let h = q(t + 1) + r, where r G {1,... ,t + 1}. The bound 
h> (t + l)(t + a) + 1 ensures q >t + a. Then, 


Cl 


h 

t d \ 


— d -\- 

= q + 

< q + 


q{t + a + 1) + r — qa 
t T ct T 1 


a(t + a + 1) + r — qa 
t T n T 1 

a(t TnTl)TtTl — (f P a')a 
t T n T 1 


< g + 1 


h 

t + 1 
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If a < t — 2, then we choose a' = a + 2. Then degQ(n, w) > ia +2 — 1, and because 
degG-(n,w) is an integer, degg(n,tc) > ia+ 2 - 

If a = t — 1, then we choose a' = t. Then degg(n, w) > —+ 4 — 1 =4 > 0, 

since p > po = it^. Because degc.(n,t(;) is an integer, degc.(n,tc) > 1. 

□ 

Proof of Lemma 18. 

We say that a long cycle is a cycle of length at least L + 1 and will show that there are 
no long cycles. Let ui • ■ • be a smallest cycle in G among all those length greater than L. 
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Case 1: 0 < a < t — 2. 

Observe that this case requires t > 2. Consider the path on the cycle 

Vi ■ ■ -viVi. There is no cycle of length ia and so the common neighbors of vi and 
are all in {^ 2 , • • • Note that Lemma 17 establishes that vi and vi^-i have at least 

C +2 > 2 common neighbors. 

Since all common neighbors of Vi and are in {^ 2 ,..., ^^^- 2 }, we have C —3 > (ia+ 2 - 
Hence, 

h ^ r h 

tTnT3 tTnT3 

and so h> (t + a + l)(t + a + 3). 

This gives that the number of common neighbors of Vi and is at least ^a +2 = 

> t + a + 2 > 4. 

Therefore, vi and has at least two common neighbors in {^ 3 ,..., ^ 4 - 3 }. Let 

i > 2 and j < ia — 2 be, respectively, the smallest and largest indices of vertices in 
{^ 3 ,..., ^£^- 3 } that are common neighbors of Vi and That is, 3 < i < j < — 3. 

The cycle ViViVi^i ■ ■ ■ has length i — i + 2. The cycle V 1 V 2 ■ ■ ■ 

has length j + i — ia + 2. 

Since these two cycles have length less than £, they cannot be long cycles. Hence, their 
length is at most — 1, giving us 


< 


h 


t T n T 1 


3 < 


h 


t T n T 1 


i-i + 2 <ia -I 

i + 3-L + ‘l<ia-l. 

We can add these inequalities and use the fact that i > L + 1. Rearranging the terms, 
we conclude the following: 


34 - 2L - 7 > 34 - 2£ - 5 > J - i + 1 > 4+2 - 2. (7) 


To verify there are no long cycles, we must show that (7) produces a contradiction. 
Since 0 < a < t — 2, 


h 


< 


3 

t d \ 

Q 1 

( h 


1 



h 

, 9 

t Q/ 3 


L 

h ■ 



7 


h 


t T u T 3 


2/i(ot T T 4o T 3) 
t(t T u T l)(t T u T 3) 

2 = 4+2 - 2, 


a contradiction for all t > 2 and h > 2t + 2. Therefore, for 0<a<t — 2, G has no cycle 
of length longer than ia — 1. 
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Case 2: a = t — 1. 

Since all common neighbors of vi and are in {^ 2 ,..., n£j_i- 2 }, we have f't-i —3 > 

1. Hence, 


1 < - 3 


h 

Yt 



and so h > 6t, which means it-i > 4. 

Consider the path vi - ■ ■ on the cycle ni • • • v^vi. To see there is no cycle of length 
it-i + 1, we set h = q{2t) — r with q > 2 and r G {0,..., 2f — 1} and have 



'h' 


—r 


h 

1 

+ 

Yt 

+ l = q + l<2q-2<2q + 

t 

< 

t 


Since vi and have a common neighbor Vi, either viViVi+i ■ ■ ■ vevi or vi - ■ ■ ViV£^_^ve^_^+i ■ ■ ■ V£Vi 
has length less than i. Without loss of generality, we will assume that it is the former. 

This gives 


it-i -l>£-i + 2>£- (£t_i -1) + 2>L + 1- (£t_i - 1) + 2. 

Consequently, 


To see that (8) is contradicted, 

2£t_£ -L-5 = 

< 

Therefore, for a = t — 1, G has no cycle of length longer than £t_i — 1. 

□ 

Proof of Proposition 19. Let be a longest path in G such that viV£ G E{G). If G 

is not Hamiltonian, there exists aw E V (G) — {ui,..., V£}. Because G is connected, there 
exists i G {!,...,£} and w' G V{G) — {ui,... ,V£} such that Vi is adjacent to w'. Then 
there is a longer path: Vj+i • • • V£Vi ■ ■ ■ Viw', a contradiction. □ 

Proof of Lemma 20. 

Because Ui • • • is a longest path in F, neither vi nor Vk can have neighbors off this path, 
as that would yield a longer path. Thus N{vi) U N{v£) C . .., Vk} in F. 

Case \\ £ < £a. 

If Vi is adjacent to ui, then Vi-i cannot be adjacent to V£. Thus, the predecessors of 


2£t_i - L - 5 > 0. 


( 8 ) 


h' 


h 

Yt 


t 


-5 


, h 
2 — + 1 
' 2t 


^-1 

t 


5 = -2 < 0. 


17 



N{vi) and the neighbors of Vi are disjoint subsets in {ui,... ,n£_i}. Since both vi and 
have degree at least t'a+i, hence 

24+1 <^-i <4-1. 


However, 


4 - 24+1 - 1 = 


< 


■ h ■ 

_ 9 

■ h ■ 

t -|- d “h 1 


t d 2, 


h 2h h{t + a) 

t (I t Qi 2 


< 0 . 


( 9 ) 


Case 2: £ > 4 + 1. 

Partition the vertices of this path into 2s + 1 consecutive sets Aq, Bi, Ai,..., Ag, Bs 
with s > 0, constructed so that, in each set H*, neighbors of vi appear before neighbors 
of V£ as follows: 

We let neighbors of vi be denoted with Up. and neighbors of V£ be denoted with Vq^ 
in this construction. Let Aq contain vi and add consecutive vertices of this path until 
we arrive at a neighbor of V£. From this point forward we do not allow another neighbor 
of Vi to be in Hq, i.e. we continue adding consecutive vertices until we reach the last 
neighbor Vq^ of V£ before another neighbor Vp^ of Vi. Then Aq = {ui ,... ,Vqq}, and we 
dehne Bi = ..., Note that this dehnition does not preclude Bi being 

an empty set. Continuing with this algorithm, we dehne sets Ai = {upj,..., and 
B 2 = {'^gi+i, • • •, where Vp^ is a neighbor of vi on this path, Vq^ is the last neighbor 

of V£ in Ai before another neighbor Vp^ of vi as shown in Figure 1. We continue in this 
way and dehne sets A = {vp ^,...,%} and Bi = {vq^_^+l ,..., Vp^-l} for i e {!,..., s}, 
adding the last vertex vn into the set Ag. 



Figure 1: Partition of vertices of the path. Sets Ai are iteratively constructed so that 
they contain consecutive vertices of this path starting with a neighbor of vi and ending 
with the last neighbor of V£ so that no neighbor of Vi appears after neighbors of V£ in each 
set. Sets Bi contain consecutive vertices between sets Ai_i and Ai, if there are any. The 
hrst vertex is placed in Aq and the last vertex V£ in Ag. 

Now we analyze this partition: 
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• We call the sets Bi, i G {1,... ,s}, gaps as they do not contain any neighbors of 
either vi or but only contain vertices that succeed a given neighbor of vi and 
precede a given neighbor of vi. According to the dehnition, gaps may be empty, but 
we will see below that this is not possible in this case. 

• Each set Aj, i G {0,..., s}, contains at most one common neighbor of Vi and v^. 

• By construction, neighbors of Vi (other than a common neighbor, if exists) precede 
neighbors of in each A*, i G {0,..., s}. 

It will suffice to show that s = 0. This will imply that no neighbor of vi follows the 
Erst neighbor of V£ on this path, which further implies that N{vi) entirely precedes N{v£), 
except possibly for a single common vertex. Since Vi and vi have at least one common 
neighbor, the lemma will follow. 

Notice that vi-■-Vq^v^vi-i-■-Vp^vi is a cycle as seen in Figure 1. In fact, for any 
i > 1, removing the gap Bi from vertices {ui,..., forms a cycle, so by assumption, 
< C — 1 and none of the gaps can be empty. Therefore, X]i=i 1-^*1 — ~C + !)• 

On the other hand, by the degree assumption and since each set Aj contains at most 
one common neighbor of Vi and we obtain 2la+i < + l-^('^£)l < (Si=o 1^*1) + 

(s + 1) — 2. Combining these two inequalities we have 

S S 

i = 'y ^ |Aj| + y ^ \Bi\ > 2^a+l — (s + 1) + 2 + s{i — + 1) 

^=0 i=l 

= — to) + 2£a_|_i + 1. 


If s > 1, then we have £ > £ — + 2£a+i + 1 which simplifies to — 2^0+1 — 1 > 0, 

which is contradicted by (9). Therefore s = 0 and the lemma follows. □ 


Proof of Fact 21. We need to show that 7 'h(po) = Po/(^ + !)• Since 

7w(Po)=Po- min -- —I, 

aG{0,...,t} [t+1 a{l-po) + [ia “ 1) Po J 

we need to show that f — a + 1 for all a G {0,..., f — 1}. 

To do this, let h = q{2t + 1) — r where r G {0,..., 2t} and q >2 (because h > 2t + 2). 
Then, 


- 1 


< 


q - 1 
1 


q -1 + 


'q{t 


Q 


q-1 
= t — Cl “h 1 “h 


1 + 


t d \ 
q{t — a) + t + a 
t -|- d “h 1 
+ 2t — g(t^ - 


{q — 1) (t + a + 1) 

which is at most f — a + lifg^Sorifa^f — 2 and q = 2. In the case where a = t — 1 
and q = 2, then = 1+1" <2 = t — a + 1. □ 
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7 Conclusion and open questions 

We have obtained the edit distance function over all of its domain for when t + 1 does 
not divide h and h > 2t{t + 1) + 1. When t + 1 divides h and h > 2t{t + 1) + 1, we have 
obtained the function for p G [po, 1], where po = I" 2^1 • function, however, is not 

known when t + 1 divides h and p G [0,po) or when h < 2t(t + 1). 

As to the case of p < po (and h sufficiently large), we showed that if A' G /C(Forb(C^)) 
is a p-core CRG with p < 1/2 which has a ^ t — 1 white vertices, then gxip) = 7 Forb(c^) ip)- 
Therefore, to solve the problem for the remaining case when t + 1 divides h, and p is small, 
one only needs to consider CRGs with exactly t — 1 white vertices. A particular barrier 
to this is Lemma 17 which requires p > po to ensure that the graph induced by the black 
vertices and gray edges of the GRG has the property that any two vertices have at least 
one common neighbor. Such a condition need not hold for small p. 

As to reducing the lower bound required of h, we note that in the proof of Theorem 3, 
we required h > 2t{t + 1) + 1 in Fact 15. This ensured that the 7 ^ function for p G 
[ 1 / 2 , 1 ] was linear and by the concavity and continuity of the edit distance function (see 
Proposition 1), this ensures that ed^(p) = 7 ^(p) in that interval. So, more careful 
analysis of the case p > 1/2 may enable one to reduce the lower bound on h, but these 
arguments are very different from the case where p < 1/2. Elsewhere, we only require 
h > max{t{t — l),2t + 2} in order to complete the proof of Theorem 3. This bound is 
required in several places. See Fact 12, Lemma 13, Lemma 18 but especially the basic 
Fact 14 which says that a set of size h can be partitioned into sets of size t or t + 1 if 
and only if h > t{t — 1). So we believe that it would be difficult to prove the theorem for 
values of h smaller than max{t(t — 1 ), 2 t + 2 } in general. 
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